Using Latent Semantic Analysis to Aid Speech Recognition and Understanding
نویسنده
چکیده
Generally, speech recognition engines can employ two different grammar methods, rule and dictation, to recognize an utterance. The purpose of these grammars is to constrain the search space in a way that anticipates the speaker’s utterance. The research described in this paper attempts to maintain the accuracy of a rule grammar without limiting the speaker to rigorous phraseology. Latent Semantic Analysis (LSA) is used to connect specific grammar rules with the meanings underlying matching phrases resulting in utterances being matched to knowledge base elements even though the exact phrase did not match any grammar rule. A separate knowledge base is used to dynamically add or remove grammar rules in the speech recognition engine as the conversation context changes. Finally, a learning technique is used to create new regular expressions based on utterances that matched semantically through LSA.
منابع مشابه
Latent Semantic Analysis based Language Models for Meetings
Language models that combine N -gram models with Latent Semantic Analysis (LSA) based models have been successfully applied for conversational speech recognition [3] and for the Wall Street Journal recognition task [1]. LSA defines a semantic similarity space using a training corpus. This semantic similarity can be used for dealing with long distance dependencies, which are an inherent problem ...
متن کاملRanking of Document Recommendations from Conversations using Probabilistic Latent Semantic Analysis
Any Information retrieval from documents is done through text search. Now a day, efficient search is done through Mining techniques. Speech is recognized for searching a document. A group of Conversations are recorded using Automatic Speech Recognition (ASR) technique. The system changes speech to text using FISHER tool. Those conversations are stored in a database. Formulation of Implicit Quer...
متن کاملSpontaneous Mandarin Speech Recognition with Disfluencies Detected by Latent Prosodic Modeling (LPM)
In this paper, a new approach for improved spontaneous Mandarin speech recognition using Latent Prosodic Modeling (LPM) for disfluency interruption point (IP) detection is presented. The basic idea is to detect the disfluency interruption points (IPs) prior to the recognition, and then to incorporate these information into the recognition process via the second pass rescoring. For accurate dete...
متن کاملPerformance Evaluation of WordNet-based Semantic Relatedness Measures for Word Prediction in Conversational Speech
The recognition of conversational speech is a hard problem. Semantic relatedness measures can improve speech recognition performance when using contextual information, as Demetriou [5] has shown. The standard n-gram approach in language modeling for speech recognition cannot cope with long distance dependencies [4]. Therefore J. Bellegarda [2] proposed combining n-gram language models, which ar...
متن کاملLatent prosodic modeling (LPM) for speech with applications in recognizing spontaneous Mandarin speech with disfluencies
In this paper, a new approach of Latent Prosodic Modeling (LPM) for analyzing the prosody of speech is presented. Based on a set of newly defined prosodic characters, prosodic terms, documents, and the Probabilistic Latent Semantic Analysis (PLSA) framework, prosody can be modeled using a set of prosodic states representing various latent factors such as speakers, speaking rate, utterance modal...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003